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Abstract Since the 1980s, recombinant human insulin 
for the treatment of diabetes mellitus has been pro- 
duced using either the yeast Saccharomyces cerevisiae 
or the prokaryote Escherichia coli. Here, development 
of the insulin secretory expression system in S. cere- 
visiae and its subsequent optimisation is described. 
Expression of proinsuiin in S. cerevisiae does not re- 
sult in efHcient secretion of proinsuiin or insulin. 
However, expression of a cDNA encoding a proinsulin- 
like molecule with deletion of threonine""' as a fusion 
protein with the S. cerevisiae a-factor prepro-peptide 
(leader), followed either by replacement of the human 
proinsuiin C-peptide with a small C-peptide (e.g. AAK), 
or by direct fusion of lysine^^^ to glycine"^', results in the 
efficient secretion of folded single-chain proinsulin-like 
molecules to the culture supernatant. The secreted sin- 
gle-chain insulin precursor can then be purified and 
subsequently converted to human insulin by tryptic 
transpeptidation in organic-aqueous medium in the 
presence of a threonine ester. The leader confers secre- 
tory competence to the insulin precursor, and con- 
structed (synthetic) leaders have been developed for 
efficient secretory expression of the insulin precursor in 
the yeasts S. cerevisiae and Fichia pastories. The Kex2 
endoprotease, specific for dibasic sites, cleaves the 
leader-insuHn precursor fusion protein in the late secre- 
tory pathway and the folded insulin precursor is secreted 
to the culture supernatant. However, the Kex2 endo- 
protease processing of the pro-peptide-insulin precursor 
fusion protein is incomplete and a significant part of the 
pro-peptide-insulin precursor fusion protein is secreted 
to the culture supernatant in a hyperglycosylated form. 
A spacer peptide localised between the leader and the 
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insulin precursor has been developed to optimise Kex2 
endoprotease processing and insulin precursor fermen- 
tation yield. 



Insulin and diabetes 

The pancreatic peptide hormone insulin was discov- 
ered in 1921-1922 by Banting and Best (Best 1922, 
192.^). In the ^-cells of the pancreatic islets of Lan- 
gerhans insulin is synthesised as prepro-insulin, then 
folded and processed into the globular two-chain 51- 
amino- acid-residue insulin molecule in the secretory 
pathway (Steiner et al. 1967, 1986; Docherty and 
Steiner 1982). The complex process of intracellular 
transport and secretion of insulin from the j8-cells is 
highly regulated. In the Golgi apparatus proinsuiin is 
targeted to secretory granules. Enzymatic conversion 
of proinsuiin to insulin occurs within acidifying se- 
cretory granules where the pro-peptide (C-peptide) is 
removed. Two mammalian functional equivalents of 
the S. cerevisiae Kex2 endoprotease PCI (or PCS) and 
PC2 cleave proinsuiin at dibasic processing sites, re- 
moving the C-peptide, and carboxypeptidase H is 
presumed to remove the two basic amino acid residues 
from tlie B-chain"s C-tenninus. 

Insulin is essential for maintaining glucose homeo- 
stasis and nonnal metabolic regulation. When the 
blood glucose level increases, the jS-cells release insulin. 
Insulin binds to specific receptors in various target 
tissues such as muscle, adipose tissue and liver. InsuHn 
lowers the blood glucose level by increasing glucose 
uptake and subsequent metabolism by tissues such as 
muscle and fat, and by suppressing hepatic glucose 
output. Diabetes mellitus is a metabolic disorder 
characterised by chronic hyperglycaemia, in which 
glucose is not taken up and metabolised normally by 
the cells because of insufficient insulin concentration. 
This may be due to jS-cell destruction (in type 1 dia- 
betes) or to a combination of jS-cell failure and resis- 
tance of the target tissues to insulin action (in type 2 



diabetes). Without effective insulin treatment, type 1 
diabetes leads to coma and death. 

Since the early 1920s it has been possible to treat 
diabetes mellitus with insulin, and for decades this was 
purified from porcine or bovine pancreas. The average 
daily dose for patients treated with insulin is approxi- 
mately 40-60 U, equal to 1.4-2.1 mg of insulin. In the 
'industrialised' part of the world more than 
130,000 mega U (approximately 4600 kg) of insulin are 
used yearly for the treatment of patients with diabetes 
mellitus, and the requirement for insulin is increasing by 
3-4% (in volume) yearly. Approximately 1 50 mg of in- 
sulin can be purified from one porcine pancreas and the 
quantity of insulin needed to satisfy current require- 
ments would be difficult to produce by pancreatic ex- 
traction. Developments in molecular biology and 
biotechnology opened up new possibilities, and in 1987 
genetically engineered human insulin was produced in 
the yeast S. cerevisiae. This simple eukaryote is re- 
markably efficient at expressing and secreting insulin, 
and it offers high productivity. 

The incidence of diabetes is increasing rapidly and it 
is estimated that, by the year 2025, the number of dia- 
betic patients worldwide will increase from 120 million 
to approximately 300 million. Only a minor fraction 
(approximately 10%) of these will have type 1 diabetes, 
dependent on insulin for survival. The vast majority will 
have type 2 diabetes. Although only relatively insulin- 
deficient and therefore not dependent on insulin for 
survival, many of these patients will progress to a state 
that requires insulin treatment for metabolic control. 
Consequently, the pharmaceutical requirement for in- 
sulin is increasing steadily, and even the productivity of 
current insulin expression systems is beginning to im- 
pose limitations. Efficient insulin production is also re- 
quired if the development of novel routes for 
administering insulin (such as the intrapulmonary route 
where bio-availability is lower than with subcutaneous 
administration) is to be both realistic and economically 
feasible. One of the challenges within applied biotech- 
nology is to provide expression systems that satisfy the 
increasing insulin requirements for the treatment of 
diabetes mellitus. Here expression and optimisation 
of insulin secretory expression in the yeast S. cerevisiae 
is described. 



Structure of the insulin molecule 

The primary structure of insulin was determined by 
Sanger et al. (1955). The mature human insulin mol- 
ecule consists of an A-chain and a B-chain, joined by 
two disulfide bonds. In addition, the A-chain has an 
intrachain disulfide bond (A6^A11). Vertebrate insu- 
lins have a high frequency of invariant and conserved 
amino acid residues. The six cysteines and the disulfide 
bond location are invariant in all vertebrate insuHns. 
The three-dimensional structure of insulin hexamers 
has been determined by X-ray crystallography (Adams 



et al. 1969; Blundell et al. 1971; Smith et al. 1984; 
Baker et al. 1988; Derewenda et al. 1989), and the 
three-dimensional structure of some soluble mono- 
meric insulin analogues has been derived by applying 
nuclear magnetic resonance spectroscopy (Ludvigsen 
et al. 1994; Olsen et al. 1996, 1998). The insulin A- 
chain (21 amino acids) features two a-helices (A2-A8 
and A13-A19) joined by a loop (A9-A12). The A- 
chain is joined to the 30-amino-acid B-chain by two 
disulfide bonds (A7-B7 and A20-B19). The B-chain 
features a single oc-helix from B9 to B19 followed by a 
turn and a j?-strand from B21 to B30 (termed the T- 
state; Fig. 1). In the presence of phenol the B-chain's 
a-helix can extend to Bl (termed the R-state; Derew- 
enda et al. 1989; Kaarsholm et al. 1989). Insulin's 
tertiary structure is stabilised by a hydrophobic core 
composed of highly conserved amino acid residues, 
including leucine and isoleucine residues (leucine^'', 
leucine®", leucine®'^, isoleucine'^^ and leucine'^^'^), the 
A6 and All cysteines, alanine®'^* and tyrosine'^'^. 
Amino acid residues in the hydrophobic core have 
numerous non-covalent interactions with peripheral 
amino acid residues, and the overall structure is sta- 
bilised by the disulfide bonds and interchain hydrogen 
bonds. 



B30/ 




Fig. 1 Model of insulin structure (courtesy of Svend Ludvigsen, 
Insulin Research, Novo Nordisk). The model is based on the 
histidine^' 'insulin analogue structure determined by nuclear magnetic 
resonance spectroscopy (Ludvigsen et al. 1994). The structure of the 
histidine^'^insulin analogue resembles the majority of structures 
detemiined by X-ray crystallography (Smith et al. 1984) and by 
nuclear magnetic resonance spectroscopy (Ludvigsen et al. 1994). The 
purpose of the figure is to orient the reader rather than to suggest a 
specific structure for insulin. The figure was produced using the 
MOLSCRIPT program (Kiaulis 1991) 
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Secretory expression of heterologous proteins 
in 5. cerevisiae 

Yeasts are unicellular eukaryotes which have a life cycle 
with certain stages where they divide by budding (Phaflf 
1990). Yeasts belong to the fungi, lack photosynthetic 
abihties and are saprophytic or parasitic, being depen- 
dent on organic carbon sources provided by other or- 
ganisms (Phaff 1990). The yeast S. cerevisiae, also 
known as baker's or brewer's yeast, has played a con- 
siderable role in food production for several thousand 
years. This is due to its highly specialised metabolism for 
alcoholic fermentation, converting mono- and some di- 
saccharides into carbon dioxide and alcohol by means of 
enzymes. The extensive use of S. cerevisiae by mankind 
for the preparation of wine, beer and bread has lead it to 
be regarded as a GRAS organism (generally regarded as 
safe). 

Over the past 50 years, S. cerevisiae has also be- 
come a powerful model system for biological research, 
involving numerous scientific groups in a wide range 
of fields of study. S. cerevisiae is an unicellular euk- 
aryote but shares many genes, organelles and func- 
tions with higher eukaryotes, including humans. It is 
now one of the best characterised eukaryotes. The 
entire S. cerevisiae genome has been sequenced and 
more than a thousand genes have been characterised 
(Goffeau et al. 1996). The secretory pathway of 
S. cerevisiae exhibits much of the structure and func- 
tion of the mammalian secretory system, with the 
capacity for folding, proteolytic processing, glycosy- 
lating and secreting proteins. 

In more recent years, S. cerevisiae has also been de- 
veloped as an efficient eukaryotic expression system for 
biotechnology (Smith et al. 1985: Brake 1989; Buckholz 
and Gleeson 1991; Romanes el al. 1992: Hadfield et al. 
1993; Heinisch and Hollenberg 1993). 5. cerevisiae is 
well suited for expressing heterologous proteins of sci- 
entific and commercial interest. It does not produce 
toxic compounds, and the wealth of experience from its 
extensive use in food production has made it an attrac- 
tive host for producting pharmaceutical proteins, in- 
cluding human insulin and insulin analogues. 
Development of plasmids combining efficiently regulat- 
ed or constitutive transcriptional promoters has allowed 
the expression of numerous foreign proteins in S. cere- 
visiae (Romanos et al. 1992; Hadiield et al. 1993; Hei- 
nisch and Hollenberg 1993). The production by 
S. cerevisiae of a substantial part of the insulin used 
worldwide for the treatment of diabetes is but one ex- 
ample of the importance of yeast in biotechnology. 

The generation of secretory expression systems in 
yeasts and other eukaryotic hosts involves processes that 
reach well beyond classical molecular biology. Efficient 
secretion of heterologous proteins from eukaryotic cells 
requires more than just signal peptide-mediated passage 
through the endoplasmic reticular membrane. Secretory 
expression is a complex multi-step process and many 



factors influence the export of heterologous proteins. 
Heterologous proteins need to be folded, transported 
through the secretory pathway and exported to the 
culture supernatant. 

The secretion process is multifunctional, consisting of 
translocation across the endoplasmatic reticulum mem- 
brane, attachment of N-linked carbohydrate chains and 
folding in the endoplasmatic reticulum lumen, transport 
from the endoplasmatic reticulum to the Golgi appara- 
tus, post-translational modification in the Golgi appa- 
ratus, transport by secretory granules to the cell 
membrane and finally exit to the extracellular space. 
Proteins destined for secretion typically feature a signal 
peptide at the N-terminus. In the endoplasmic reticulum, 
the signal peptide is removed by the signal peptidase, 
and primary oligosaccharides (GlnNAc2Man9Glc3) are 
covalently attached by the oligosaccharyltransferase to 
glycoproteins at consensus sites for N-linked glycosyla- 
tion (NXT/S), creating a core glycosylated protein 
(Kornfeld and Kornfeld 1985; Tanner and Lehle 1987; 
Lehle and Tanner 1995). Disulfide bonds and the ter- 
tiary structure of membrane and secretory proteins are 
formed in the endoplasmic reticulum before being 
transported to their site of action. The correct folding of 
proteins in the endoplasmatic reticulum may depend on 
the function of auxiliary proteins (such as chaperones 
and protein disulfide isomerase) which mediate folding 
and disulfide bond formation (Freedman 1989; Freed- 
man et al. 1994; Hartl 1996; Ferrari and Soling 1999). A 
vesicle system mediates transport of the protein to the 
Golgi compartment in a highly regulated fashion 
(Nothwehr and Stevens 1994; Warren and Malhotra 
1998). In the Golgi apparatus, core carbohydrate chains 
of glycoproteins are further modified by outer-chain 
glycosylation. From the Golgi compartment, the protein 
is loaded into vesicles and transported to their destina- 
tion. Genetic analyses suggest that vacuolar and secre- 
tory proteins share a common pathway to the Golgi 
apparatus where sorting occur (Schekman 1982). 

S. cerevisiae cells of mating type oe secrete a 13 -residue 
peptide pheromone (oe-factor) essential for mating with 
cells of mating type a (Thorner 1981). The ot-factor is the 
product of the MFa\ gene which encodes a 165-residue 
polypeptide (prepro-oe-factor) featuring a leader which 
consists of the 1 9-amino-acid signal (pre-) sequence, 
followed by the 66-amino-acid pro-sequence with three 
consensus sites for N-linked glycosylation and a dibasic 
Kex2 endoprotease processing site at the C-terminus, 
together with four repeats of the mature a-factor during 
secretion (Thorner 1981; Kurjan and Herskowitz 1982). 
In the prepro-a-factor, each 13-residue a-factor is pre- 
ceded by a spacer peptide of 4-6 amino-acid residues 
[(E/D)A]2_3, preceded by a dibasic processing site, ly- 
sine-arginine (KR; Thorner 1981; Kurjan and Hersko- 
witz 1982). Maturation of the a-factor requires 
processing of the prepro-a-factor by four different pro- 
teolytic enzymes (Emter et al. 1983; Julius et al. 1983, 
1984a, b; Fuller et al. 1988; Brake 1989). The signal 
peptidase cleaves the prepro-a-factor between amino 
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acid residues 19 and 20 (Waters et al. 1988). The Kex2 
endoprotease encoded by the KEX2 gene cleaves the 
glycosylated pro-a-factor after the pro-peptide and 
between the a-factor repeats (Julius et al. 1984a, b; 
Achstetter and Wolf 1985). Cleavage by the Kex2 en- 
doprotease occurs on the C-terminal side of the dibasic 
sequence KR in the pro-peptide (Julius et al. 1984a; 
Achstetter and Wolf 1985; Brake 1989). Maturation of 
the excised a-factor requires exoproteolytic processing of 
both C- and N-termini (Brake 1989). The dipeptidyl 
aminopeptidase A encoded by the STE13 gene removes 
EA or DA dipeptides from the N-terminus (Julius et al. 
1983; Fuller et al. 1988). In addition, the carboxypepti- 
dase serine protease encoded by the KEXl gene removes 
arginine (R) and lysine (K) residues from the C-terminus 
(Dmochowska et al. 1987). 

It was rapidly established that the S. cerevisiae 
a-factor leader was able to confer secretory competence 
on proteins expressed in S. cerevisiae (Emr et al. 1983; 
Bitter et al. 1984; Brake et al. 1984; Singh et al. 1984). 
Subsequently, the a-factor leader has become the clas- 
sical leader for secretory expression in S. cerevisiae, is 
generally applicable for secretory expression and has 
been used to facilitate secretion of numerous heterolo- 
gous proteins, including insulin (Bitter et al. 1984; Brake 
et al. 1984; Zsebo et al. 1986; Markussen et al. 1987; 
Thim etal. 1987, 1989; Brake 1989; Romanes et al. 
1992; Jonassen et al. 1994; Kjeldsen et al. 1996). 



Expression of insulin in S. cerevisiae 

Expression of proinsulin fused to the a-factor leader in 
S. cerevisiae did not result in efficient secretion of pro- 
insulin or insulin to the culture supernatant (Thim et al. 
1986a). However, expression of a cDNA encoding a 



proinsulin-like molecule with deletion of threonine^^'', 
followed either by replacement of the human proinsulin 
C-peptide with a small C-peptide (SK or AAK), or by 
direct fusion of lysine^^^ to glycine'^\ and fusion to the 
a-factor leader, resulted in efficient expression of these 
single-chain proinsulin-like molecules (Table 1, Fig. 2; 
Markussen et al. 1987, 1990). Likewise, Jonassen et al. 
(1994) developed an efficient S. cerevisiae secretory ex- 
pression system for insulin, based on the insulin B-chain 
residues 1—29 linked to the A-chain by short connecting 
peptides and characterised by having a dibasic process- 
ing site prior to glycine'^^ e.g. EKR. Furthermore, 
expression of a full-length B-chain connected to the 
A-chain by various mini C-peptides (e.g. RRLQKR) led 
to the secretion of a mixture of insulin and des(B30) 
insulin to the culture supernatant (Thim et al. 1986a, b, 
1987, 1989). This type of insulin precursor can be con- 
verted into insulin by digestion with trypsin (EC 3.4.2 1 .4) 
and carboxypeptidase B after purification (Thim et al. 
1987, 1989). Taken togelher these data indicate that al- 
though the S. cerevisiae secretory pathway does not have 
the ability to express human proinsulin (efficient expres- 
sion being impaired by the C-peptide), it does have the 
ability to fold and export single-chain proinsulin-like 
precursors. During transport through the secretory 
pathway, proteolytic enzymes involved in maturing the 
prepro-a-t'actor also mature the fusion protein, and 
the single-chain proinsulin-like precursor is secreted to 
the culture supernatant (iMg. 3). The fusion proteins are 
processed by the signal peptidase in the endoplasmic re- 
ticulum and by the Kex2 endoprotease in the late secre- 
tory pathway (Fig. 3). The single-chain insulin precursor 
could then be purified and subsequently converted to 
human insulin by tryptic transpeptidation in organic- 
aqueous medium in the presence of a threonine ester 
(Markussen 1987; Markussen et al. 1987). 



Table 1 Fermentation yield of insulin precursor expressed in 
Saccharomyces cerevisiae by different leaders. Insulin precursors 
comprise the first 29 ammo acids of the human insulin B-chain, 

)omed to the 21 ammo acids of the human msulm A-chain by a 
"mmi C-peptide , AAK. The msulm precursor was expressed m 
the 5'. cerevisiae stram MJbbi"^ (a gifl from Moecns trier Hansen. 
Novo Nordisk: MATalMATa pep4-^ prp4-} UIS4 Ius4 ipi::LI':L'2 
tpi LEl'2 Cir + ) iismg the POr expression plasmid iFig A) Each 
leader consists of a pre-peptide (the a-factor leader signal peptide 
or the Yap3 endoprotease signal peptide) and a pro-peptide. but 



only the pro-peptide and spacer peptides are shown here, see Fig. 2. 
Fermentation was at 30 °C for 72 h in YPD. Yield was determined 
by RP-HPLC of the culture supernatant, and is expressed as 
mean + standard deviation (SD) of the values obtained in four 
individual femientations. Where leaders lack the Kex2 en- 
liopioiLjsL sUl iLimcntation >ield was determined as des(B30) 
msulm alter maluralion by A. lyticus lysyl specific endoprotease 
(EC 3.4.21.50). Consensus N-lmked glycosylation sites [NX(T/S)] 
as well as mutated non-functional glycosylation sites (QTT) are 
underlmed 



S. cerevisicu 


' Pro- 


Pro-peptide amino acid sequence 


Spacer 


Yield 












(mg/1) 




MT748'' 




APVNTTTEDETAQIPAEAVIGYSDLEGDFDVAVLPFSNSTNNGLLFI 




16.5 ± 0.2 






NTTIASIAAKEEGVSLDKR 








yAK708 




APVNTTTEDETAQIPAEAVIGYSDLEGDFDVAVLPFSNSTNNGLLFI 


EEAEAEAEPK 


41.1 ± 


0.9 






NTTIASIAAKEEGVSMAKR 








yAK721 


LA19 


QPIDDTESNTTSVNLMADDTESRFATNTTLALDWNLISMAKR 


EEAEAEAEPK 


50.3 ± 


3.8 


yAK775 


LA34 


QTT QTT AA 


EEAEAEAEPK 


25.6"= ± 


3.9 


yAK817 


TA39 


NTT NTT-GGLDWNLISMAKR 


EEGEPK 


59.5 ± 


6.7 


yAK855 


TA57 


QTT A QTNSGGLDVVGLISMAKR 


EEGEPK 


79.2 ± 


3.7 



^An a-factor leader in which the C-terminus has been modified from "SLDKR" to "SMAKR". The LA19 pro-peptide amino acid 
sequence is shown; for the next three pro-peptides the sequence is given only at positions where it differs from the LA19 pro-peptide amino 
acid sequence 
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Fig. 2 Schematic representation of the leader-msulin precursor fusion 
protein expressed in Sacchwomrces cerevnuie. A leader consists of a 
pre-peptide and a pro-peptide. Here the 5. cerevisiae a-factor leader is 
indicated (leader). Pre indicates the pre-peptide (signal peptide) and 
Pro indicates the pro-peptide. The dihasic Kex2 endoprotease 
processing site, localised at the pro-peptides C-temiinus, is shown m 
black. The single-chain insulin precursor (IP), comprising the first 29 




amino acids of the human insulin B-chain (B1-B29) is joined to the 21 
amino acids of the human insulin A-chain (A1-A21) by the mini 
C-peptide AAK (C) connecting lysine^"^ and glycine*^'. Certain fusion 
proteins also feature a spacer peptide iS) with the amino acid sequence 
(EEAEAEAEPK, or derivatives of this sequence). Sites for attaching 
N -linked carbohydrate chains, for processing by the signal peptidase 
and for processing by the Kex2 endoprotease are indicated 



The S. cerevisiae a-factor leader has three consensus sites 
for the attachment of N-hnked carbohydrate chains. 
N-linked glycosylation is important, but not essential, 
for the abihty of the (x-factor leader to secrete a-factor 
(Julius et al. 1984b; Caplan et al. 1991). The a-factor 
pro-peptide's three N-linked carbohydrate chains are 
also important for the secretion of the insulin precursor. 
Mutation of all three sites by changing the asparagine to 
a glutamine decreased the quantity of secreted insulin 
precursor to 13% of that obtained using the a-factor 
pro-peptide with three functional N-linked glycosylation 
sites. However, there was a clear difference in the relative 
importance of the three sites with regard to their influ- 
ence on insulin precursor secretion. The two N-linked 
carbohydrate chains closest to the insulin precursor were 
significantly more important than the third site for the 
ability to facilitate the secretion of the insuUn precursor, 
and there was no synergistic effect of the N-linked 



oligosaccharide chains with respect to secretion (Kjeld- 
sen et al. 1998a). 

Insulin was expressed in S. cerevisiae utilising a 
synthetic gene encoding the fusion protein a-factor- 
leader-insuUn precursor inserted into a S. cerevisiae- 
Escherichia coli shuttle plasmid, the POT plasmid 
(Fig. 4), which is based on the S. cerevisiae 2 |xm 
plasmid (Markussen et al. 1987; MacKay et al. 1990; 
Kawasaki and Bell 1999). Transcription of the syn- 
thetic fusion protein gene was controlled by the 
S. cerevisiae triose phosphate isomerase gene (TPIl) 
promoter and terminator (Fig. 4; Alber and Kawasaki 
1982; Thim et al. 1986a; Markussen et al. 1987; 
MacKay et al. 1990; Kawasaki and Bell 1999). Auto- 
selection of the POT expression plasmid in S. cerevi- 
siae was based on the Schizosaccharomyces pombe TPI 
encoded by the POT gene. S. cerevisiae strains [like 
MT663 (MATa/MAToi pep4-3/pep4-3 HIS4/his4 



Fig. 3 Schematic representa- 
tion of the 5. cerevisiae secre- 
tory pathway, showing 
transport, processing and ex- 
port of the a-factor leader in- 
sulin precursor fusion protein. 
Endoplasmic reticulum (ER), 
Golgi apparatus (GOLGP), vac- 
uole (V), secretory vesicles 
(SV), a-factor pre-peptide (pre), 
a-factor pro-peptide (pro) and 
human insulin precursor (IP) 
are indicated. Attachment and 
modification of the a-factor 
pro-peptide's three N-linked 
carbohydrate chains, as well as 
processing, both by the signal 
peptidase and by the Kex2 
endoprotease, are also shown 



ER 



GOLGi 



SV 



V 
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TPi-TERM!MATOR 



11228 hp 



/ 



Fig. 4 The pAK721 S. cerevisiae expression plasmid expressing the 
LA19 leader-EEAEAEAEPK-insulin precursor fusion protein. The 
pAK721 expression plasmid was based on the S. cerevisiae- 
Escherichia coli shuttle POT plasmid (MacKay et al. 1990; Kawasaki 
and Bell 1999). Z,-/P indicates the fusion protein expression cassette, 
TPI-PROMOTER the 5. cerevisiae TPIl promoter and TPI- 
TERMINATOR the 5. cerevisiae TPIl terminator. TPI-POMBE 
indicates the 5. pombe POT gene used for selection in 5. cerevisiae; 
Oi?/G/iV indicates a 5. cerevisiae origin of replication derived from the 
2-|jm plasmid. AMP-R indicates the ^-lactamase gene conferring 
resistance toward ampicillin and facilitating selection in E. coli. 
ORIGIN-PBR322 indicates an E. coli origin of replication 



tpi::LEU2/tpi::LEU2 Cir'*')] with a non-reverting dele- 
tion/disruption in the TPIl chromosomal gene encoding 
TPI grow poorly when glucose is the only carbon source 
(MacKay et al. 1990; Kawasaki and Bell 1999). Trans- 
formation of Atpil S. cerevisiae strains (like the MT663 
strain) with the POT expression plasmid allows selection 
by the ability to grow on glucose in rich medium 
(MacKay et al. 1990; Kawasaki and Bell 1999). The 
Sch. pombe POT gene is poorly expressed in S. cerevi- 
siae and multiple copies of the POT plasmid are re- 
quired to generate sufficient gene product to allow 
growth on glucose as the sole carbon source (MacKay 
et al. 1990; Kawasaki and Bell 1999). However, the 
TPIl promoter is constitutive and powerful, and tran- 
scription of synthetic genes on high-copy plasmids by 
this promoter may induce down-regulation of the pla- 
smid copy number. Such plasmid down-regulation may 
influence the quantity of secreted heterologous protein. 
The POT plasmid expression system has been success- 
fully scaled up (Diers et al. 1991) and used for stable 
long-term continuous cultures with high production 
levels. The combination of a powerful constitutive 
glycolytic promoter, autoselection to stabiHse the ex- 
pression plasmid and growth in rich medium, an insulin 
molecule specifically modified for secretion, and long- 
term large-scale continuous fermentation has established 
the ability of S. cerevisiae to produce pharmaceutically 
important polypeptides, with the required quality, 
authenticity and quantity. 



Development of spacer peptides for optimisation 
of Kex2 endoprotease processing of tlie pro-peptide- 
insulin precursor fusion protein and concomitant 
increase of the yield of the insulin precursor 

In some early attempts to use the a-factor leader for the 
secretion of heterologous proteins in S. cerevisiae (e.g. 
interferon-al, jS-endorphin and human epidermal 
growth factor) the a-factor spacer peptide [(E/D)A]2_3 
was retained between the dibasic Kex2 endoprotease site 
and the heterologous protein (Bitter et al. 1984; Brake 
et al. 1984; Singh et al. 1984; Zsebo et al. 1986; Piggott 
et al. 1987). However, removal of the spacer peptide 
from the heterologous proteins by the dipeptidyl amino- 
peptidase A was found to be inefficient. Also expression 
of an a-factor leader EAEA-insulin precursor fusion 
protein led to the secretion of a molecule N-terminally 
extended by either EAEA or EA (Thim et al. 1986a). In 
general, removal of EA di peptides from the heterolo- 
gous protein by the dipeptidyl aminopeptidase A 
encoded by the STE13 gene has proved to be incom- 
plete. Moreover, expression of the MFal gene on a 
high-copy plasmid resulted in the secretion of incom- 
pletely processed a-factor similar to that produced by 
stel3 S. cerevisiae mutants (Julius et al. 1983). These 
data suggested that the quantity of dipeptidyl amino- 
peptidase A present in the secretory pathway was 
insufficient to process the amount of synthesised 
heterologous protein expressed by synthetic genes on 
high-copy plasmids. Consequently, spacer peptides were 
generally avoided and heterologous proteins, including 
insulin, were fused directly to the leader (Markussen 
et al. 1987; Brake 1989; Markussen et al. 1990). How- 
ever, expression of fusion protein without a spacer 
peptide might result in the secretion of unprocessed and 
partially processed fusion protein, indicating that the 
spacer peptide may facilitate Kex2 endoprotease activity 
(Zsebo et al. 1986; Piggott et al. 1987; Brake 1989). 
Pulse-chase analysis of an a-factor leader-insulin pre- 
cursor fusion protein without a spacer peptide expressed 
in 5. cerevisiae showed secretion of hyperglycosylated 
fusion protein to the culture supernatant (Kjeldsen et al. 
1996). Thus, Kex2 endoprotease processing of the pro- 
peptide-insuHn precursor fusion protein was incomplete 
and a significant part of the pro-peptide-insulin 
precursor fusion protein was secreted to the culture 
supernatant in a hyperglycosylated form (Fig. 3; 
Kjeldsen et al. 1996). 

To improve Kex2 endoprotease processing of the 
fusion protein, a charged spacer peptide was introduced 
between the pro-peptide and the insuUn precursor 
(Kjeldsen et al. 1996). However, this resulted in the se- 
cretion of an insulin precursor molecule which was N- 
terminally extended. To facilitate in vitro removal of the 
spacer peptide from the insulin precursor, a lysine (K) 
residue was introduced in the spacer peptide C-terminus 
before phenylalanine^ ^ Transpeptidation (Markussen 
1987) of the N-terminally extended insulin precursor. 
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either by trypsin (EC 3.4.21.4) or by Achromohacter 
lyticus lysyl-specific endoprotease (EC 3.4.21.50), re- 
moved both the spacer peptide and the "connecting- 
peptide" as well as adding a threonine residue to ly- 
sine^^^, to generate human insulin. 

Introduction of a spacer peptide (e.g. EE- 
AEAEAPK), based on the prepro-a-factor [(E/D)A]2-3 
spacer peptide after the dibasic Kex2 endoprotease site 
and before the phenylalanine^^ of the insulin precursor, 
significantly increased the fermentation yield of the in- 
sulin precursor expressed in the S. cerevisiae (Table 1; 
Kjeldsen et al. 1996). Concomitantly, the quantity of 
hyperglycosylated pro-peptide-insulin precursor fusion 
protein secreted to the culture supernatant decreased. 



Development and characterisation of leaders 
(prepro-peptide) for secretoiy expression of 
the insulin precursor in 5. cerevisiae 

The pro-peptide was essential for export of the insulin 
precursor expressed in S. cerevisiae (Markussen et al. 
1987, 1990; Kjeldsen et al. 1999b). Consequently, de- 
velopment of leaders and especially pro-peptides has 
been a key target for optimising the fermentation yield 
of the insulin precursor expressed in yeast. Semi-random 
mutation has been used to develop a number of con- 
structed (synthetic) leaders for secretory expression of 
the insulin precursor in S. cerevisiae (Kjeldsen et al. 
1997) and some of these are listed in Table 1. The con- 
structed leaders were equal to or better than the S. ce- 
revisiae a-factor leader at facilitating export of the 
insulin precursor to the culture supernatant (Table 1). 
Surprisingly, the constructed leaders did not depend on 
N-linked glycosylation for their ability to facilitate se- 
cretion of the insulin precursor (Table 1; Kjeldsen et al. 
1998a). The secretory capacity of constructed leaders 
which lack N-linked glycosylation sites is illustrated by 
the TA57 leader which, combined with the spacer pep- 
tide EEGEPK, increased the fermentation yield of in- 
sulin precursor to 480% relative to the a-factor leader 
insulin precursor fusion protein (Table 1). 

Secreted fusion protein based on a constructed leader 
which lack both a dibasic processing site and N-linked 
glycosylation was readily processed into des(B30) insulin 
by the addition of A. lyticus lysyl specific endoprotease 
(EC 3.4.21.50) to the culture supernatant (Table I; 
Kjeldsen et al. 1998b). The Kex2 endoprotease catalytic 
efficiency toward the fusion protein can be a limiting 
factor in yeast secretory expression. In vitro maturation 
of a purified fusion protein by an alternative enzyme 
would eliminate limitations on the fermentation yield 
imposed by the Kex2 endoprotease and would also 
prevent N-terminal processing of heterologous proteins 
by dipeptidyl aminopeptidase A. Constructed leaders 
without hyperglycosylation provide an opportunity for 
the purification of secreted fusion protein and subse- 
quent in vitro maturation with a suitable enzyme 
(Kjeldsen et al. 1998b). 



Secretion kinetics of the insulin precursor expressed 
in 5. cerevisiae 

Secretion kinetics of the leader-insulin precursor fusion 
protein was investigated by pulse-chase labelling 
(Kjeldsen et al. 1999b). Secretion was rapid, with the 
first insulin precursor appearing in the culture superna- 
tant within 2-4 min of the chase. The overall kinetic 
profiles for insulin precursor secretion mediated by dif- 
ferent leaders were comparable, indicating a similar in- 
tracellular transport and processing of these fusion 
proteins in the S. cerevisiae secretory pathway. The 
majority of secreted insulin precursor appeared in the 
culture supernatant within the first 15 min (Kjeldsen 
et al. 1999b). Furthermore, the rapid secretion of an 
insulin precursor with disulfide bonds identical to that of 
human insulin indicated a rapid disulfide bond forma- 
tion and folding in the S. cerevisiae endoplasmic reti- 
culum. 

After 2.5 min of metabolic labelling of a leader in- 
sulin precursor fusion protein, the majority was core- 
glycosylated on one or both of the pro-peptide N- 
linked glycosylation sites, indicating that the majority 
of the fusion protein was present either in the endo- 
plasmic reticulum or on its way to the Golgi apparatus 
(Kjeldsen et al. 1999b). This indicated translocation 
and stabilisation of the fusion protein in the endo- 
plasmic reticulum mediated by the leader. Some ex- 
tended carbohydrate chains appeared to be present on 
the pro-peptide, and enzymatic processing of the fusion 
protein by the Kex2 endoprotease located in a late 
Golgi compartment (Cunningham and Wickner 1989; 
Wilcox et al. 1992; Redding et al. 1996a, b) had already 
occurred to a minor degree. Therefore, after 2.5 min a 
minor fraction of the fusion protein had already 
reached a late Golgi apparatus compartment. After 
35 min of the chase, only 2-A% of the insulin precursor 
was still present intracellularly as fusion protein. 
However, after 35 min approximately 30% of the 
metabolically labelled insulin precursor was still present 
as intracellular insulin precursor (Kjeldsen et al. 
1999b). The fusion protein was almost completely 
proteolytically processed to insulin precursor by the 
Kex2 endoprotease. Therefore, the majority of this in- 
sulin precursor fraction was located in a Golgi or post- 
Golgi compartment. The intracellularly retained insulin 
precursor might have been sorted to the vacuole. 
Moreover, the selective intracellular retention of the 
insulin precursor relative to the fusion protein indicates 
that retention was associated with the biophysical/bio- 
chemical properties of the insuUn precursor. Intracel- 
lular retention of a substantial amount of the 
synthesised insuHn precursor indicated that the insulin 
precursor followed two different intracellular routes 
and that secretion to the culture supernatant reflected 
either the saturation of a Golgi or post-Golgi com- 
partment due to overexpression, or secretion occurring 
in competition with intracellular retention. 
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Prevention of folding of the insulin precursor, 
whether by mutation or by preventing disulfide bond 
formation (using dithiothreitol), resulted in degradation 
and intracellular retention (Kjeldsen et al. 1999b). 
Misfolding or partial folding of the insulin precursor 
and consequent exposure of the insulin's hydrophobic 
core increased the hydrophobic surfaces of the mole- 
cule, and this might provide the signal for intracellular 
retention. 

The insulin precursor has also been expressed in the 
methylotrophic yeast Pichia pastoris (Kjeldsen et al. 
1999a). Proinsulin and the insulin precursor were ex- 
pressed in P. pastoris using different leaders to confer 
secretory competence. Expression of an a-factor leader 
proinsulin fusion protein in P. pastoris did not result in 
the secretion of proinsulin. However, shortening the C- 
peptide, deleting threonine^^'^ and associating with the 
S. cerevisiae a-factor leader, as described for secretory 
expression in S. cerevisiae (Markussen et al. 1987, 1990), 
resulted in the efficient secretion of the insulin precursor 
in P. pastoris. Presumably, a P. pastoris endopeplidase 
functionally equivalent to the S. cerevisiae Kex2 endo- 
protease cleaved the fusion protein. Constructed leaders 
developed for secretory expression in S. cerevisiae also 
facilitated secretion of the insulin precursor in P. pasto- 
ris. As in S. cerevisiae, constructed leaders without 
N-linked glycosylation sites facilitated secretion of the 
insulin precursor expressed in P. pastoris. Furthermore, 
N-linked glycosylation was important for the ability of 
the a-factor pro-peptide to facilitate efficient transport 
and secretion of the insulin precursor. These data indi- 
cate that P. pastoris and S. cerevisiae have similar overall 
characteristics regarding the expression and secretion of 
insulin. 




Steadily increasing requirements for insulin for treating 
the growing numbers of patients with diabetes mellitus 
have encouraged the search for efficient biotechnological 
expression systems for the production of human insulin. 
Since 1987, genetically engineered human insulin has 
been produced in the yeast S. cerevisiae. This simple 
eukaryote is remarkably efficient at expressing and se- 
creting insulin, and it offers high productivity. However, 
it was necessary to adapt the insulin molecule to secre- 
tory expression in yeast by deleting threonine^^'' and 
replacing human proinsulin's C-peptide with a small C- 
peptide, together with fusion to a leader. The leader 
appears to mediate the introduction of the insulin pre- 
cursor into the endoplasmic reticulum and its stabilisa- 
tion in the endoplasmic reticulum, emphasising the 
importance of the leader for secretory expression. 
Autoselection plasmid systems, as described here, allow 
for selection in rich medium and large-scale continuous 
fermentation and illustrate the capacity of the yeast 
S. cerevisiae for producing pharmaceutically important 
proteins. This points toward further development of 



autoselection plasmid systems and chromosomal inte- 
gration of expression plasmids to ameliorate the secre- 
tory expression of recombinant proteins in S. cerevisiae, 
as regards both quantity and authenticity. 
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